Webscraping and Data Visualisation

Webdata 1 - List of S&P 500 companies

The Standard&Poor 500 stock market index is maintained by S&P Dow Jones Indices. It comprises of 503 common stocks which are issued by 500 large-cap companies traded on American stock exchanges.The index includes about 80 percent of the American equity market by capitalization. It is weighted by free-float market capitalization, so more valuable companies account for relatively more weight in the index. The web data is referenced by the link https://en.wikipedia.org/wiki/List_of_S%26P_500_companies and we are going to scrape out the data using Beautiful soup

As observed that the most of high stock companies falls under the information technology sector and below plotly shows the seggregation between stock companies related sectors and the date it got added to the stock market

Webdata 2 - List of highest-grossing films in the United States and Canada

This is a list of the highest-grossing films in the U.S. and Canada,a market known in the film industry as the North American box office, or as the domestic box office within the U.S. itself. The chart is ranked by lifetime gross,a film's earnings from its initial release are also included to provide a basis for comparison between films released around the same time. Reference wikipedia link : https://en.m.wikipedia.org/wiki/List_of_highest-grossing_films_in_the_United_States_and_Canada#

we are going to perform the data cleaning in each columns

We can observe from plots, out of 200 highest grossing movies , 4 movies had reached the 1 Billion lifetime gross And star wars has both high level of initial gross and lifetime gross

Python widgets for dataset2